TC TAGACTGGAC AGC ATC CAC AAGAGAAGC ACC TAGAAGGAGAATTTTC CC C AGC AGC TTGC TC AGGAC C C 
TGCAGGAGCCGCAGCTGGGACTGGACCTGCTGTTAACCATGAACTCTTCCTGCTGCCTGTCTTCTGTTTCT 
CCGATGCTGCCTAACCTCTCTGAGCACCCTGCAGCCCCTCCTGCCAGCAACCGGAGCGGCAGTGGGTTCTG 
TGAGCAGGTCTTCATCAAGCCGGAGGTCTTCCTGGCTCTGGGCATCGTCAGTCTGATGGAAAACATCCTGG 
TGATCCTGGCTGTGGTCAGGAATGGCAACCTGCACTCTCCCATGTACTTCTTCCTGTGCAGCCTGGCTGCA 
GCCGACATGCTGGTGAGCCTGTCCAACTCCCTGGAGACCATCATGATCGCCGTGATCAACAGCGACTCCCT 
GACCTTGGAGGACCAGTTTATCCAGCACATGGATAATATCTTCGACTCTATGATTTGCATCTCCCTGGTGG 
CCTCCATCTGCAACCTCCTGGCCATTGCCATCGACAGGTACGTCACCATCTTCTATGCCCTTCGGTACCAC 
AGCATCATGACAGTTAGGAAAGCCCTCACCTTGATCGGGGTCATCTGGGTCTGCTGCGGCATCTGCGGCGT 
GATGTTCATCATCTACTCCGAGAGCAAGATGGTCATCGTGTGTCTCATCACCATGTTCTTCGCCATGGTGC 
TCCTCATGGGCACCCTATATATCCACATGTTCCTCTTCGCCAGGCTCCACGTCCAGCGCATCGCAGTGCTG 
CCCCCTGCTGGCGTGGTGGCCCCACAGCAGCACTCCTGCATGAAGGGGGCTGTCACCATCACTATCCTGCT 
GGGTGTTTTCATCTTCTGCTGGGCGCCTTTCTTCCTCCACCTGGTCCTCATCATCACCTGCCCCACCAATC 
CCTACTGCATCTGCTACACGGCCCATTTCAACACCTACCTGGTTCTCATCATGTGCAACTCCGTCATCGAC 
CCCCTCATCTACGCCTTCCGCAGCCTGGAGCTGCGCAACACGTTCAAGGAGATTCTCTGCGGCTGCAACAG 
CATGAACTTGGGCTAGGATGCCCGTGGAGGTGTTCCACATCCAGCCAAGAGACAAAAACAACGCTCAGACG 
GGAC GT AAAAGGGTGTT AGG AGC TGG AAC TGTGC TTGGC TTC GTC TGTAAGC TCGTGGC CC TTTGC AG AC G 
GGACACGGCGTAGGATGGGCTGTCTGTGAGGATCTGTGTGTGGGTAAGTCAGTTTGATCTAGCACATAGCC 
TGGAAGAATCAGGCAAAGCAGCCCTGAGTGTCATCTGTGTTCATTGCTAGGCACCCAGGGTTTGTGGCCCC 
TGCCTGCTTATTGGCTTTGTACCAGTAACTGTGCTTCAAGCCAACCAGACCGGAGGGCTCTCGTGAGCAGA 
AAGAGTGCTTAGACTTCCGGCAAGCATCCTGGCTCACAGCGGCCACCTCCTGACCACTACCGGGAGAGCTT 
TGCACATATTCTGTGGGAGATTGAGTGAAGCCCTGAAAACAATGTGATATTTGCTGCTCCCTTCCAGAACT 
TACATCTGTGCCAGCCTCCCCGAACCCCTGCACAGAGACATGACCCCCTTCTCCCTGTGCCGTTGTCATGG 
TTGTTATTATTGTTGGAGTTTTGTTCGTTAAAATCTAAGCTT (SEQ ID N0:1) 



jynSFSSCCLSSVSPMLPNLSEHPAAPPASNRSGSGFCEQVFIKPEVFLALGIVSLMENILVILAWRJSTGNLHS 
PMYFFLCSLAAADMLVSLSNSLETIMI AVINSDSLTLEDQF IQHMDNI FDSMIC I SLVAS ICNLLAI AIDR 
WTIFYALRYHSIMTVRKALTLIGVIWCCGICGVMFIIYSESKWIVCLITMFFAM\n^LMGTLYIHMFL^ 
ARLHVQRIAVLPPAGWAPQQHSCMKGAVTITILLGVFIFCWAPFFLHLYLIITCPTNPYCICYTAHFNTY 
LVLIMCNSVIDPLIYAFRSLELRNTFKEILCGCNSMNLG (SEQ ID N0:2) 



FIGURE 1 



Underlined = deleted in targeting construct 

Bold = sequence flanking Neo insert in targeting construct 



TCTAGACTGGACAGCATCCACAAGAGAAGCACCTAGAAGGAGAATTTTCCCCAGCAGCTT 
GCTCAGGACCCTGCAGGAGCCGCAGCTGGGACTGGACCTGCTGTTAACCATGAACTCTTC 
CTGCTGCCTGTCTTCTGTTTCTCCGATGCTGCCTAACCTCTCTGAGCACCCTGCAGCCCC 
TCCTGCCAGCAACCGGAGCGGCAGTGGGTTCTGTGAGCAGGTCTTCATCAAGCCGGAGGT 
CTTCCTGGCTCTGGGCATCGTCAGTCTGATGGAAAACATCCTGGTGATCCTGGCTGTGGT 
CAGGAATGGCAACCTGCACTCTCCCATGTACTTCTTCCTGTGCAGCCTGGCTGCAGCCGA 
CATGCTGGTGAGCCTGTCCAACTCCCTGGAGACCATCATGATCGCCGTGATCAACAGCGA 
CTCCCTGACCTTGGAGGACCAGTTTATCCAGCACATGGATAATATCTTCGACTCTATGAT 
TTGCATCTCCCTGGTGGCCTCCATCTGCAACCTCCTGGCC ATTGCCATCGACAGGTACGT 
CACCATCTTCTATGCCCTTCGGTACCACAGCATCATGACAGTTAGGAAAGCCCTCACCTT 
GATCGGGGTCATCTGGGTCTGCTGCGGCATCTGCGGCGTGATGTTCATCATCTACTCCGA 
GAGCAAGATGGTCATCGTGTGTCTCATCACCATGTTCTTCGCCATGGTGCTCCTCATGGG 
CACCCTATATATCCACATGTTCCTCTTCGCCAGGCTCCACGTCCAGCGCATCGCAGTGCT 
GCCCCCTGCTGGCGTGGTGGCCCCACAGCAGCACTCCTGCATGAAGGGGGCTGTCACCAT 
CACTATCCTGCTGGGTGTTTTCATCTTCTGCTGGGCGCCTTTCTTCCTCCACCTGGTCCT 
CATCATCACCTGCCCCACCAATCCCTACTGCATCTGCTACACGGCCCATTTCAACACCTA 
CCTGGTTCTCATCATGTGCAACTCCGTCATCGACCCCCTCATCTACGCCTTCCGCAGCCT 
GGAGCTGCGCAACACGTTCAAGGAGATTCTCTGCGGCTGCAACAGCATGAACTTGGGCTA 
GGATGCCCGTGGAGGTGTTCCACATCCAGCCAAGAGACAAAAACAACGCTCAGACGGGAC 
GTAAAAGGGTGTTAGGAGCTGGAACTGTGCTTGGCTTCGTCTGTAAGCTCGTGGCCCTTT 
GCAGACGGGACACGGCGTAGGATGGGCTGTCTGTGAGGATCTGTGTGTGGGTAAGTCAGT 
TTGATCTAGCACATAGCCTGGAAGAATCAGGCAAAGCAGCCCTGAGTGTCATCTGTGTTC 
ATTGCTAGGCACCCAGGGTTTGTGGCCCCTGCCTGCTTATTGGCTTTGTACCAGTAACTG 
TGCTTCAAGCCAACCAGACCGGAGGGCTCTCGTGAGCAGAAAGAGTGCTTAGACTTCCGG 
C AAGC ATC C TGGC TC AC AGCGGC C AC C TC C TGAC C AC T AC C GGG AG AGC TTTGC AC AT AT 
TCTGTGGGAGATTGAGTGAAGCCCTGAAAACAATGTGATATTTGCTGCTCCCTTCCAGAA 
CTTACATCTGTGCCAGCCTCCCCGAACCCCTGCACAGAGACATGACCCCCTTCTCCCTGT 
GCCGTTGTCATGGTTGTTATTATTGTTGGAGTTTTGTTCGTTAAAATCTAAGCTT 



FIGURE 2A 



Gene Sequence 
Structure * 



296 bp 



Sequence Deleted 



519 bp 



Size of CDS: 1675 bp 




Targeting Vector* (genomic sequence) 
Construct Number: 2787 



Arm Length: 
5': 1.6 kb 
3': 4 kb 



Endogenous Locus 



* Not drawn to scale 



LacZ-Neo 

/.Gazette - 




5 1 >ACCTGCTGTTAACCATGAACT 

CTTCCTGCTGCCTGTCTTCTGTTT 

CTCCGATGCTGCCTAACCTCTCTG 

AGCACCCTGCAGCCCCTCCTGCCA 

GCAACCGGAGCGGCAGTGGGTTCT 

GTGAGCAGGTCTTCATCAAGCCGG 

AGGTCTTCCTGGCTCTGGGCATCG 

TCAGTCTGATGGAAAACATCCTGG 

TGATCCTGGCT<3 1 

(SEQ ID NO:3) 



5 ■ >CATTGCCATCGACAGGTACGT | 
CACCATCTTCTATGCCCTTCGGTA | 
CCACAGCATCATGACAGTGAGGAA j 
AGCCCTCACCTTGATCGGGGTCAT 
CTGGGTCTGCTGCGGCATCTGCGG 
CGTGATGTTCATCATCTACTCCGA 
GAGC AAGATGGTC ATC GTGTGTCT 
CATC AC C ATGTTC TTC GC C ATGGT 
GCTCCTCATGG<3 1 
(SEQ ID NO:4) 



FIGURE 2B 



